Automatic Resource-Centric Process Migration for MPI
نویسندگان
چکیده
Process migration refers to the ability to move a running process from one node and make it continue on another. The MPI standard prescribes support for process migration, but so far it was implemented mostly via checkpoint-restart. This paper presents an automatic and transparent process migration framework that can be used for MPI processes. This framework is advantageous when migration of individual processes for purposes such as load-balancing is more adequate than checkpointing the whole job. The paper describes this framework for process migration in clusters and multi-clusters, how it was tuned for Open MPI and the performance of migrated MPI processes.
منابع مشابه
Hector: Automated Task Allocation for MPI
Many institutions already have networks of workstations, which could potentially be harnessed as a powerful parallel processing resource. A new, automatic task allocation system has been built on top of MPI, an environment that permits parallel programming by using the message–passing paradigm and implemented in extensions to C and FORTRAN. This system, known as ‘‘Hector’’, supports dynamic mig...
متن کاملG-javampi: a Grid Middleware for Transparent Mpi Task Migration∗
Resources in a grid are dynamic, heterogeneous, and widely distributed. End users need a simple and efficient way to aggregate and utilize these diverse resources. We introduce a grid middleware called G-JavaMPI, which combines a high-level message passing interface with the Java language to support portable messaging-passing programming in a grid. Different from traditional MPI implementations...
متن کاملCharm++ & MPI: Combining the Best of Both Worlds
MPI and Charm++ embody two distinct perspectives for writing parallel programs. While MPI provides a processor-centric, user-driven model for developing parallel codes, Charm++ supports work-centric, overdecompositionbased, system-driven parallel programming. One or the other can be the best or most natural fit for distinct modules that constitute a parallel application. In this paper, we prese...
متن کاملMOSIX Evaluation on a Linux Cluster
Multicomputer Operating System for Unix (MOSIX) is a cluster-computing enhancement of Linux kernel that supports preemptive process migration. It consists of adaptive resource sharing algorithms for high performance scalability by migrating processes across a cluster. Message passing Interface (MPI) is a library standard for writing message passing programs, which has the advantage of portabili...
متن کاملNetwork-centric performance analysis of runtime application migration in mobile cloud computing
Mobile Cloud Computing alleviates the limitations of resource-constrained mobile devices by leveraging the cloud resources. Currently, software-level solutions, also known as computational offloading, migrate the cloud-based mobile applications at runtime to the cloud datacenter to optimize the application execution time. However, the application execution frameworks mainly focus on migrating t...
متن کامل